A Flexible Approach for a Fault-Tolerant Router

نویسندگان

  • Andreas C. Döring
  • Wolfgang Obelöer
  • Gunther Lustig
  • Erik Maehle
چکیده

Cluster systems gain more and more importance as a platform for parallel computing. In this area the power of the system is strongly coupled with the performance of the network, which has to provide high bandwidth and low latency. Besides these performance aspects fault-tolerance within the network is very important. This paper shows how to build a flexible and faulttolerant router, the main building part of a network. In addition the overhead for the execution of fault-tolerant routing algorithms is examined.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CAFT: Cost-aware and Fault-tolerant routing algorithm in 2D mesh Network-on-Chip

By increasing, the complexity of chips and the need to integrating more components into a chip has made network –on- chip known as an important infrastructure for network communications on the system, and is a good alternative to traditional ways and using the bus. By increasing the density of chips, the possibility of failure in the chip network increases and providing correction and fault tol...

متن کامل

Design of an Active Approach for Detection, Estimation and Short-Circuit Stator Fault Tolerant Control in Induction Motors

Three phase induction motors have many applications in industries. Consequently, detecting and estimating the fault and compensate it in a way that the faulty induction motor satisfies the predefined goals are important issues. One of the most common faults in induction motors is the short circuit of the stator winding. In this paper, an active fault-tolerant control system is designed and pres...

متن کامل

An edge router based protocol for fault tolerant handling of advance reservations

To provide quality of service guarantees, resource reservation schemes have to maintain reservation states at the nodes along the path of a flow. Advance reservation schemes have to maintain these states for a long period of time. The loss of reservation state caused by node failures, makes advance reservation schemes highly susceptible to node failures. In this paper, we argue that a domain-by...

متن کامل

Cost-Efficient Fault-Tolerant Router Design for 2D-Mesh Based Chip Multiprocessor Systems

In this paper, a cost-efficient fault-tolerant router design, called 20-Path Router (20PR) architecture, is proposed to reduce the impacts of faulty routers for 2D-mesh based chip multiprocessor systems. The 20PR consists of two fault-tolerant circuits: 1) a Builtin Self-Test and Self-Diagnosis (BIST/SD) circuit to detect and locate faulty FIFOs and MUXs, and 2) a Fault-Isolation (FI) circuit t...

متن کامل

Fault-Tolerant Communication with Partitioned Dimension-Order Routers with Complex Faults

ÐThe current fault-tolerant routing methods require extensive changes to practical routers such as the Cray T3D's dimension-order router to handle faults. In this paper, we propose methods to handle faults in multicomputers with dimension-order routers with simple changes to router structure and logic. Our techniques can be applied to current implementations in which the router is partitioned i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998